Parallel Block Matrix Factorizations for Distributed Memory Multicomputers

نویسندگان

Krister Dackland

Erik Elmroth

چکیده

EEcient and scalable parallel block algorithms for the LU factor-ization with partial pivoting, the Cholesky, and QR factorizations in a distributed memory multicomputer environment are presented. The distributed system is viewed as a ring of processors and the algorithms correspond to shared memory algorithms parallelized on block level (explicit parallelism). Performance of the algorithms are analyzed theoretically and illustrated empirically by implementations on the Intel iPSC/2 hypercube. A model predicting performance and optimal block size is presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Performance Modeling of Parallel Block Matrix Factorizations for Distributed Memory Multicomputers

EEcient and scalable parallel block algorithms for the LU factorization with partial pivoting, the Cholesky, and QR factorizations in a distributed memory multicomputer environment are presented. The distributed system is viewed as a ring of processors and the algorithms correspond to shared memory algorithms parallelized on block level (explicit parallelism). Performance of the algorithms are ...

متن کامل

A Ring-Oriented Approach for Block Matrix Factorizations on Shared and Distributed Memory Architectures

A block (column) wrap-mapping approach for design of parallel block matrix factorization algorithms that are (trans)portable over and between shared memory multiprocessors (SMM) and distributed memory multicomputers (DMM) is presented. By reorganizing the matrix on the SMM architecture, the same ring-oriented algorithms can be used on both SMM and DMM systems with all machine dependencies compr...

متن کامل

A parallel Block Lanczos algorithm and its implementation for the evaluation of some eigenvalues of large sparse symmetric matrices on multicomputers

In the present work we describe HPEC (High Performance Eigenvalues Computation), a parallel software for the evaluation of some eigenvalues of a large sparse symmetric matrix. It implements a Block Lanczos algorithm efficient and portable for distributed memory multicomputers. HPEC is based on basic linear algebra operations for sparse and dense matrices, some of which have been derived by ScaL...

متن کامل

Ring-oriented Block Matrix Factorization Algorithms for Shared and Distributed Memory Architectures

Utilizing experiences from the implementations on shared memory multiprocessors (SMM) and distributed memory multicomputers (DMM), general ring-oriented routines are developed for the LU, Cholesky, and QR factorizations. Since, all machine dependencies are comprised to a small set of communication routines, the same factorization routines can be used on both the SMM and DMM architectures. The a...

متن کامل

A Parallel Computational Kernel for Sparse Nonsymmetric Eigenvalue Problems on Multicomputers

The aim of this paper is to show an effective reorganization of the nonsymmetric block lanczos algorithm efficient, portable and scalable for multiple instructions multiple data (MIMD) distributed memory message passing architectures. Basic operations implemented here are matrix-matrix multiplications, eventually with a transposed and a sparse factor, LU factorisation and triangular systems sol...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1992

Parallel Block Matrix Factorizations for Distributed Memory Multicomputers

نویسندگان

چکیده

منابع مشابه

Design and Performance Modeling of Parallel Block Matrix Factorizations for Distributed Memory Multicomputers

A Ring-Oriented Approach for Block Matrix Factorizations on Shared and Distributed Memory Architectures

A parallel Block Lanczos algorithm and its implementation for the evaluation of some eigenvalues of large sparse symmetric matrices on multicomputers

Ring-oriented Block Matrix Factorization Algorithms for Shared and Distributed Memory Architectures

A Parallel Computational Kernel for Sparse Nonsymmetric Eigenvalue Problems on Multicomputers

عنوان ژورنال:

اشتراک گذاری